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Abstract: Compared with normal differentiated cells, cancer cells upregulate the expression of 
pyruvate kinase isozyme M2 (PKM2) to support glycolytic intermediates for anabolic processes, 
including the synthesis of nucleic acids, amino acids, and lipids. In this study, a combination of 
the structure-based pharmacophore modeling and a hybrid protocol of virtual screening methods 
comprised of pharmacophore model-based virtual screening, docking-based virtual screening, 
and in silico ADMET (absorption, distribution, metabolism, excretion and toxicity) analysis 
were used to retrieve novel PKM2 activators from commercially available chemical databases. 
Tetrahydroquinoline derivatives were identified as potential scaffolds of PKM2 activators. Thus, 
the hybrid virtual screening approach was applied to screen the focused tetrahydroquinoline 
derivatives embedded in the ZINC database. Six hit compounds were selected from the final hits 
and experimental studies were then performed. Compound 8 displayed a potent inhibitory effect 
on human lung cancer cells. Following treatment with Compound 8, cell viability, apoptosis, and 
reactive oxygen species (ROS) production were examined in A549 cells. Finally, we evaluated 
the effects of Compound 8 on mice xenograft tumor models in vivo. These results may provide 
important information for further research on novel PKM2 activators as antitumor agents. 
Keywords: pharmacophore, molecular docking, pyruvate kinase, virtual screening 

Introduction 

Altered metabolism is one the earliest observed differences between most adult healthy 
tissues and rapidly growing tissues, such as fetal tissues and tumors. Increased cell 
proliferation, which characterizes tumor growth, increases the need for nutrients and 
maintains a balance between the utilization of nutrients for energy production and ana- 
bolic processes. 1 Cancer cells have a higher nutrient uptake to support the production 
of new cells and to satisfy the increased biosynthetic demand compared with normal 
tissues, which require a lower nutrient uptake and use most of the available nutrients 
for energy production rather than for protein, lipid, and nucleic acid syntheses. 2 " 4 In the 
1920s, Warburg reported that cancer cells have increased rates of lactate production 
even in the presence of oxygen. 5 This effect is known as aerobic glycolysis or the 
Warburg effect and might be due to the reprogramming of metabolic genes to allow 
cancer cells to achieve the nutrient requirements for cell proliferation. 5 Recently, 
the Warburg effect has been explored for the common clinical detection of tumors 
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via fluorodeoxyglucose positron emission tomography. 
Although less efficient in ATP production, aerobic glycolysis 
is commonly enhanced in cancer cells through deregulation 
pathways, including the activation of the PI3K-AKT-mTOR 
pathway and the upregulation of hypoxia-inducible factor- la 
(HIF-la) and c-Myc. 6-8 Several metabolic enzymes, such as 
hexokinase 2 (HK2), lactate dehydrogenase A, and pyruvate 
dehydrogenase kinase 1 , are direct targets of oncogenic tran- 
scription factors, such as HIF-la and c-Myc. 9 

The delicate balance required for tumor cells to endure 
unfavorable conditions, such as hypoxia and low glucose sup- 
ply, remains unclear. Pyruvate kinase catalyzes the last step 
in glycolysis to convert the substrate phosphoenolpyruvate 
into pyruvate and produce one molecule of ATP per reac- 
tion per cycle. 10 Takenaka et al discovered the alternative 
RNA splicing that yields the Ml and M2 forms of pyruvate 
kinase isozyme (PKM1 and PKM2). 11 PKM2 is extensively 
expressed in undifferentiated embryonic tissues, and there 
is a switch to express PKM1 during the development of 
many differentiated tissues. Most cancer cell lines have been 
studied to determine the exclusive expression of PKM2. 12 
PKM2 has been found in some normal tissues that require 
high rates of nucleic acid synthesis. The downregulation 
of PKM2 activity provides a purposeful divergence from 
catabolic metabolism toward an anabolic state to provide 
the necessary resources for rapid cell proliferation. 13 PKM2 
exists in equilibrium between a low activity T- state and a 
high activity R- state. The active forms of PKM2 are tetra- 
meric and allosterically activated by FBP through binding 
at the flexible loop region near the dimer-dimer interface of 
the tetramer. 1 ^ 16 Small molecule PKM2 activators stabilize 
the tetramer form, affect cancer metabolism, and provide 
a novel antitumor therapeutic method. Recently, Thomas 
et al reported the first small molecule PKM2 activators, 
which include a series of diarylsulfoamides (Figure 1). 15-17 
Additional PKM2 activator chemotypes, such as tetrahydro- 
quinolines, have been reported since then. 1819 A relatively 
different structural class with pyrido[l,2-a]pyrimidin 
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moieties was identified in a recent study through an in silico 
high-throughput screen (Figure l). 20 Computational research 
should be done to develop a possible antitumor agent design 
and to further explore the binding mode between the synthe- 
sized ligands and PKM2. 

Our ongoing research aimed to search specific PKM2 
activators 21 and to explore techniques to generate more 
accurate and reasonable structure-based computer-aided drug 
design methods. 22 29 Structure-based pharmacophore (SBP) 
design and hybrid protocol of virtual screening can be used to 
detect novel tetrahydroquinoline-based lead compounds based 
on changes in the chemical scaffold and can provide useful 
references for the design and preliminary evaluation of PKM2 
activators. The combination of SBP models and subsequent 
drug design strategies can provide insights into the required 
interaction patterns for PKM2 protein allosteric regulation. 
Moreover, the developed SBP model can be used for vir- 
tual screening to discover novel potential lead compounds. 
These results may provide significant information for further 
research on novel PKM2 activators as antitumor agents. 

Materials and methods 

Generation of structure-based 
pharmacophore models 

A set of seven crystal structures of PKM2 in complex with 
diverse ligands (Table 1) were obtained from the Protein Data 
Bank (PDB). 30 The crystal structures with FBP, the natural 
ligand of PKM2, were not used in the analyses in order to 
avoid the unnecessary noise likely to be introduced into the 
pharmacophore model. 31 The generation procedures of the 
structure-based pharmacophore models were referenced to 
our previous reports. 22 26 The whole process of generation 
and utilization of the structure-based pharmacophore models 
were illustrated in Figure 2 and detailed as follows. Briefly, 
the coordinates of all PKM2-ligand X-ray crystal structures 
were transformed into a common reference frame by using 
"Multiple Structure Alignment (Modeller)" module within 
Discovery Studio (DS). 32 The complex-based pharmacophore 
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Figure I Representative chemical structures of previously reported hPKM2 activators. 

Abbreviations: PKM, pyruvate kinase isozyme; AC 50 , activation concentration 50%; hPKM2, human PKM2. 
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Table I Analyses of critical amino acids and pharmacophore features from seven PKM2-activators' co-crystal structures deposited 
in the Protein Data Bank 



PDB Number 3GR4 



3GQY 



3U2Z 



3H60 



3ME3 



4GIN 



4JPG 



Resolution 1.60 A 

Ligand DYY 
Release date April 7, 2009 

Interception residue 

A: Phe26 
A: Leu27 

A: Met30 

A: Leu353 



A: Tyr390 
A: Gln393 
A: Leu394 
A: Glu397 
B: Phe26 

B: Met30 



1.85 A 
DZG 

April 7, 2009 

A: Phe26 
A: Leu27 

A: Met30 
A: Asn3l8 
A: Leu353 
A: Asp354 

A: Tyr390 
A: Gln393 
A: Leu394 
A: Glu397 
B: Phe26 
B: Leu27 
B: Met30 



Donor I 
Donor 2 
Acceptor I 
Acceptor 2 



2.10 A 2.00 A 

07T D8G 
December 9, 20 1 2 May 5, 2009 



A: Phe26 
A: Leu27 

A: Met30 



A: Tyr390 
A: Gln393 
A: Leu394 

B: Phe26 





B: Asn3l8 








B: Leu353 


B: Leu353 






B: Asp354 


B: Asp3S4 


B: Asp354 




B: Tyr390 


B: Tyr390 


B: Tyr390 




B: Gln393 


B: Gln393 


B: Gln393 




B: Leu394 


B: Leu394 


B: Leu394 




B: Glu397 


B: Glu397 


B: Glu397 


Pharmacophore 


model features 






Arl 


V 






Ar2 


V 






Ar3 








Ar4 








HPI 


V 






HP2 


V 






HP3 









A: Phe26 



1.95 A 
3SZ 

April 28, 2010 
A: Phe26 



A: Asn3l8 



2.30 A 
NZT 

October 10,2012 
A: Phe26 



A: Met30 



A: Leu353 


A: Leu353 


A: Leu353 


- 


A: Asp354 


A: Asp354 






A: Ile389 


A: Tyr390 


A: Tyr390 


A: Tyr390 


A: Gln393 


A: Gln393 


A: Gln393 


A: Leu 3 94 


A: Leu 3 94 


A: Leu 3 94 


A: Glu397 


A: Glu397 


A: Glu397 


B: Phe26 


B: Phe26 


B: Phe26 


B: Leu27 


B: Leu27 


B: Leu27 




B: Met30 


B: Met30 






B: Lys3 1 1 




B: Leu353 


B: Leu353 




B: Asp354 


B: Asp354 


B: Tyr390 


B: Tyr390 


B: Tyr390 


B: Gln393 


B: Gln393 


B: Gln393 


B: Leu394 


B: Leu394 


B: Leu394 




B: Glu397 




V 


V 


V 


V 


V 








V 


V 








V 






V 





2.33 A 
I OX 

May 22, 2013 
A: Phe26 
A: His29 

A: Leu353 

A: Ile389 
A: Tyr390 

B: Phe26 
B: Lys3 1 I 



B: Gln393 



Abbreviations: PDB, Protein Data Bank; Ar, aromatic rings; HP, hydrophobic features; PKM, pyruvate kinase isozyme. 



generation module embedded in DS was used to generate 
seven individual complex-based pharmacophore models 
based on the previously aligned structures. All pharmacoph- 
ore features identified by DS were clustered according to 
their interaction pattern with the PKM2 protein. The obtained 
model was further refined by the modification of the constraint 
tolerance of the spheres in accordance with the default values 
of pharmacophore modules embedded in DS. 

Pharmacophore-based virtual screening 

The ligand-based procedure was applied to the ZINC 
database supplied via the worldwide web, which contained 



about 35 million commercially available compounds. 33 34 
The ZINC database provides files corresponding to 
substances distributed from different vendors, and the entire 
database was processed by using the SMILES compounds' 
codes. Among this large set of compounds the tetrahydro- 
quinoline scaffold was found only in approximately 12,000 
compounds, which were further investigated. In the present 
study, structure-based pharmacophore model comprising 
of six chemical features was used as a query for searching 
tetrahydro-quinoline based chemical library. We performed 
all database searching experiments using the "Best" confor- 
mation generation option and the "Flexible" fitting method 
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Figure 2 Virtual screening protocol for the identification of novel PKM2 activators. 

Abbreviations: PKM, pyruvate kinase isozyme; ADMET, absorption, distribution, metabolism, excretion and toxicity; PBVS, pharmacophore based virtual screening; 
DBVS, docking based virtual screening. 



option in the "pharmacophore" protocols of DS software. 35 37 
The molecules in the tetrahydro-quinoline based chemical 
library which fit with all the features of the pharmacophore 
model were retained as hits. The criterion for screening for 
further validation was high fit values, which indicate good 
matches. 

Molecular docking study 

Molecular docking studies were performed to generate the 
bioactive binding poses of PKM2 activators in the active site 
of proteins by using the LibDock module in DS software. 
LibDock uses protein site features, referred to as hot spots, 
consisting of two types (polar and apolar). In the current 
study, the CHARMm Force Field was used for energy mini- 
mization of the ligands. The binding sphere was primarily 
defined as all residues of the target within 10 A from the 
ligand binding site. Conformer Algorithm based on Energy 
Screening and Recursive build-up (CAESAR) was used for 
generating conformations. Then, the smart minimizer was 
used for in situ ligand minimization. All other parameters 
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used were utilized default settings. To better understand the 
interactions between protein residues and bound ligands, the 
Ligplot program 38 was used for generating the 2D diagrams 
that helped to identify the binding site residue, including 
amino acid residues, waters, and ligand compounds. 

Fingerprint based similarity analysis 

The chemical similarity search was conducted with DS 
using the standard protocol. The MACCS structural keys 
were utilized with the Tanimoto Coefficient (Tc) as the 
similarity metric. 39 

In silico ADMET analysis 

ADMET refers to the absorption, distribution, metabolism, 
excretion, and toxicity properties of a molecule within an 
organism, which were predicted using the ADMET module 
and the TOPKAT module embedded in DS software. 40 In the 
ADMET module, six mathematical models (aqueous solubil- 
ity, blood-brain barrier penetration, CYP2D6 inhibition, hepa- 
totoxicity, human intestinal absorption, and plasma protein 
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binding) were used to quantitatively predict properties of a set 
of rules that specify ADMET characteristics of the chemical 
structure of the molecules. The hit compounds possessing 
satisfactory ADMET properties were chosen according to 
TOPKAT calculation. The TOPKAT predictions help in opti- 
mizing therapeutic ratios of lead compounds for further devel- 
opment and assessing their potential safely concerns. They will 
help in evaluating intermediates, metabolites, and pollutants, 
along with setting dose range for animal assays. 36 

In vitro pyruvate kinase assay 

An in vitro pyruvate kinase enzymatic assay was performed 
using the Kinase-Glo Plus Assay kit (Promega Corpora- 
tion, Fitchburg, WI, USA), using methods referred to in 
the literature. 1518 All reactions were carried out at room 
temperature for 1 hour in a 50 |lL mixture containing PK 
assay buffer, 0.1 mM ADP (Sigma- Aldrich Co., St Louis, 
MO, USA), 0.5 mM PEP (Sigma-Aldrich Co.), 0.9 nM 
PK enzymes (BPS Bioscience, San Diego, CA, USA), and 
the test compounds. FBP (Sigma-Aldrich Co.) was used 
as a positive control for activation at concentration of 0.5 
mM. At the end of enzymatic reactions, 50 jlL of Glo Plus 
reagent was added to each well and luminescence was 
measured using a BioTek Synergy™ microplate reader. 
In the absence of the compound, the luminescence was 
defined as 0% activity. The percent activity in the pres- 
ence of each compound was calculated according to the 
following equation: 

% activity = [(L - L b )/(L t - L b ) - 1] xl00%, (1) 

where L = the luminescence in the presence of the compound, 
L, = the luminescence in the absence of PKM2, and L + = 

b 7 t 

the luminescence intensity in the absence of the compound. 
The values of % activity versus a series of compound 
concentrations were plotted using non-linear regression 
analysis of sigmoidal dose-response curve generated with 
the equation: 

Y = B + (T - B)/l + 10((logEC 5 -X) x hill slope), (2) 

where Y = percent activity, B = minimum percent activity, 
T = maximum percent activity, EC 50 = effective concentration 
50%, X = logarithm of compound, and Hill Slope = slope 
factor or Hill coefficient. 

Cell lines and animals 

A549 (human lung cancer cell line) was obtained from 
the American Type Culture Collection (ATCC; Manassas, 
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VA, USA). Cells were propagated in RPMI 1,640 media 
containing Gibco 10% fetal bovine serum (FBS; Thermo Fisher 
Scientific, Waltham, MA, USA) and 1% antibiotics (penicillin 
and streptomycin) in 5% C0 2 at 37°C. Female BALB/c mice 
used in this study were obtained from Beijing HFK bioscience 
CO., Ltd. (Beijing, People's Republic of China). 

Cell proliferation assay 

Cells (3-5x1 OVwell) were seeded in 96-well culture plates. 
After 24 hours of incubation, the cells were treated with 
vehicle (0.1% DMSO [dimethyl sulfoxide]) and various 
concentrations of Compound 8 for 72 hours in normoxia 
(21% 0 2 ) or hypoxia (1% 0 2 ), respectively. Then, 20 |lL of 
5 mg/mL MTT was added to each well and the plates were 
incubated for an additional 2 hours at 37°C. The medium was 
subsequently discarded, and 150 |lL DMSO was added to 
dissolve the formazan. Absorbance was measured at 570 nM 
using a Spectra MAX M5 microplate spectrophotometer 
(Molecular Devices LLC, Sunnyvale, CA, USA) and 
the IC 50 (inhibitory concentration 50%) values were 
calculated. 

Colony formation assay 

Briefly, A549 cells (-400-500 cells/well) were seeded in 
a 6- well plate. After 24 hours of incubation, the cells were 
treated with various concentrations of Compound 8 and then 
cultured for another 10 days in hypoxia (1% 0 2 ), respectively. 
After washing by phosphate buffer saline (PBS), colonies 
were fixed with 4% paraformaldehyde and stained with a 
0.5% crystal violet solution. Finally, the colonies with >50 
cells were counted under an inverted microscope. 

Morphological analysis by EdU staining 

Briefly, A549 cells were plated in a 6-well plate for 24 hours. 
The cells were treated with Compound 8 for another 48 hours 
in normoxia (21% 0 2 ) or hypoxia (1% 0 2 ) and then washed 
with cold PBS. Finally, cells were labeled with 30 mmol/L 
5-ethenyl-20-deoxyuridine (EdU; Invitrogen; Thermo Fisher 
Scientific) solutions according to the manufacturer's instruc- 
tions. Then nuclear morphology of cells was examined under 
an inverted fluorescence microscopy (Zeiss Axiovert 200; 
Zeiss, Jena, Germany). 

Apoptosis analysis by flow cytometry (FCM) 

After 1 5 jlM Compound 8 treatment for 48 hours in normoxia 
(21% 0 2 ) or hypoxia (1% 0 2 ) the cells were harvested and 
washed with cold PBS twice. Then the level of apoptosis 
was determined using the apoptosis detection kit according 
to manufacturer's instructions. 
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Subcutaneous tumor mouse model 

To investigate the anti-tumor activity of Compound 8, a 
subcutaneous A549 tumor mouse model was used. 100 |iL 
of A549 cell suspension (5xl0 5 cells) were injected subcu- 
taneously into the right flank of mice at day 0, and when the 
tumors were palpable at day 4, mice were randomized into 
four groups (12 mice per group). At day 4, 9, and 14, mice 
were injected intravenously with 100 |lL of NS (control), low 
dose Compound 8 (25 mg/kg), or high dose Compound 8 
(75 mg/kg), respectively. Mice in each group were used for 
tumor growth inhibition assay, and tumor size was measured 
every three days using calipers. Tumor volume was calcu- 
lated by the equation: 

Vol = 0.52 x (a x b 2 ), (3) 

where vol is tumor volume, a is the length of the major axis, 
and b is the length of the minor axis. 

Results and discussion 

SBP models 

In our previous reports, the SBP model of PKM2 was gen- 
erated using six PKM2 activator co-crystal complexes, and 
pharmacophore-based QSAR was performed for a series of 
Aryl- sulfonamide derivatives. Recently, a novel series of 
pyrido[l,2-a]pyrimidin derivatives as PKM2 activators and 
the corresponding PKM2 co-crystal complexes were resolved 
simultaneously. In this study, the seven X-ray crystallogra- 
phy structures of PKM2 in complex with small molecular 
activators were used to construct the SBP. The results of 
molecular superposition based on Modeller module in the 



DS software were reported (see Figure SI). The detected 
pharmacophore features, which determine the amount of 
complexes in a given feature, are shown in Table 1. The 
eleven pharmacophore features included two hydrogen bond 
acceptors (A1-A2), two hydrogen bond donors (D1-D2), 
three hydrophobic features (HP1-HP3), and four aromatic 
rings (Arl-Ar4). In these features, three (Arl, Ar2, and 
HP1) were found as common in the seven complexes. The 
comprehensive pharmacophore map and the ligand-binding 
conformation at the binding site of PKM2 are shown in 
Figure 3 A. The obtained comprehensive pharmacophore 
map was initially too restrictive and inappropriate for virtual 
screening because of the large number of chemical features, 
and the fit of a molecule to such a pharmacophore was still 
unattainable for today's state of the art computational tools. 41 
A correctly reduced pharmacophore model is more preferable 
for practical applications. 31 According to our previous results 
and experience, the top-ranked three features (Arl, Ar2, and 
HP1) were selected from the comprehensive pharmacophore 
map and merged to generate the SBP (Figure 3B). 

Virtual screening 

Virtual screening of chemical libraries may facilitate the 
search for novel lead compounds suitable for further research. 
Compared with other de novo design methods, virtual screen- 
ing can easily detect the desired compounds from commercial 
sources for biological activity assay. 25 In the present study, we 
performed the focused tetrahydroquinoline library search by 
using the Best and Flexible search options. Virtual screening 
through SBP was performed to assess the commercially avail- 
able focused tetrahydroquinoline library (1 1 ,890 compounds). 




Figure 3 Comprehensive pharmacophore features (A) and the common structure-based pharmacophore model (B) generated by seven PKM2-activator complexes. 
Note: The green boxes indicate the key pharmacophore features. 

Abbreviations: PKM, pyruvate kinase isozyme; Ar, aromatic rings; HP, hydrophobic features. 
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A hit list of 4,572 compounds that matched the pharmacophore 
model was obtained. Moreover, the docking program produced 
several positions with different orientations within the binding 
site. Each position produced a different LibDock scores. The 
best score was considered for further studies. The compounds 
bearing > 100 LibDock scores were considered as active new 
hits and were used to sample a sufficient chemical space and 
to increase the hit rate. A set of 286 hits satisfied the speci- 
fied cut-off score and were then used for further evaluations. 
Compounds with similar structures were rejected for retrieving 
new scaffolds. Finally, a small set of 10 hits with favorable 
drug-like properties, high docking score, and preferred dock- 
ing position were obtained (Table 2). A representative com- 
pound with the highest docking score and fitness values, and 
its interactions with the binding site are shown in Figure S2. 
The analysis of the protein-ligand complexes revealed the 
binding site residue, including amino acid residues, water, 
and ligands. A 2D diagram showing various interactions, such 
as hydrogen bonds, atomic charge interactions, and Pi-sigma 
interactions between the surrounding residues and the ligand, 
was also presented. The high structural diversity of the final 
hits demonstrated that hits with similar chemical features and 
novel scaffolds could be retrieved. The pairwise similarity was 
assessed by Tanimoto coefficients (Tc) 42 between VS hits and 
each hit's nearest neighbor from the active compounds in the 
known compounds set. The majority of these virtual hits were 
found to be less similar to the compounds already known, so 
it might be interesting to test these hits experimentally. To 
verify the diversity of those virtual hits, pairwise similarity 
calculations were performed. 

In silico ADMET analysis 

The docking results suggested that the ten hits had a good 
docking score (Table 2). The docking studies were promis- 
ing; thus, the chemical descriptors for the pharmacokinetic 
properties were calculated. Pharmacokinetic properties, 
including aqueous solubility, blood-brain barrier penetration, 
CYP2D6 binding, hepatotoxicity, intestinal absorption, and 
plasma-protein binding, were determined. The calculation 
was designated as the first step toward analyzing the novel 
chemical entities and checking the failure of lead candidates. 
The failed lead candidates might cause toxicity or might be 
metabolized by the body into an inactive form or in a form 
without the ability to cross membranes. The results of this 
analysis are presented in Figure 4. The biplot figure shows 
the two analogous 95% and 99% confidence ellipses for 
the blood-brain barrier penetration and human intestinal 
absorption models, respectively. The detailed results of 



pharmacokinetic properties and toxicity analysis for the six 
selected compounds are shown in Table 3. The solubility 
and partition coefficient were calculated to determine phar- 
macokinetic properties, whereas the toxicity, mutagenicity, 
tumorigenicity, irritation effect, and risk of reproductive 
effect were predicted. The logP was predicted to determine 
the hydrophilicity of both compounds. A high logP was 
associated with poor absorption or permeation; thus, the 
value should be less than 5. The results suggested that the 
six compounds were within this limit, and Compound 6 had 
the best logP. Likewise, skin irritancy was not observed in 
all compounds except in Compound 9, which showed mild 
irritancy. Other properties such as rat oral LD 50 (lethal dose 
50%), Ames mutagenicity, developmental toxicity poten- 
tial, rat inhalational LC 50 (lethal concentration 50%), rat 
maximum tolerated dose, fathead minnow LC 50 , and aerobic 
biodegradability are also provided in Table 4. These results 
further demonstrate the need to discover new PKM2 activa- 
tors for cancer treatments. 

In vitro pyruvate kinase assay 

An in vitro pyruvate kinase enzymatic assay was performed. 
Hit Compound 8, which was identified from the virtual screen, 
possessed the highest AC 50 (activation concentration 50%) 
value versus 0.111 |iM PKM2. However, Compounds 3 and 
9 can activate PKM2 with AC 50 values of 8.47 and 2.38 |iM, 
respectively. Compounds 3,8, and 9 had a similar scaffold, 
and the main difference between these compounds was the 
fusion ring at the 3,4 position of the tetrahydroquinoline 
scaffold. The other three compounds activated the PKM2 
with AC 50 values from 0.469 |lM to 0.966 |lM. This find- 
ing suggests that the tetrahydroquinoline core structure is a 
potent moiety of hit compounds to interact with PKM2. Thus, 
modifying the core structure might result in determining more 
potent PKM2 activators. 

The anti- proliferative effects 
of Compound 8 

We investigated the proliferation inhibition caused by Com- 
pound 8 treatment on A549 human lung cancer cells. After 
exposure to Compound 8 for 72 hours, only a few decreases 
in the cellular viability were observed. Moreover, exposure 
of A549 cells to hypoxia, or hypoxia combined with Com- 
pound 8, for 24 hours, 48 hours and 72 hours, respectively, 
caused a marked decrease in the viability (Figure 5A). These 
results demonstrated that Compound 8 inhibited the prolifera- 
tion of A549 cells in a hypoxic environment in a time- and 
concentration-dependent manner. 
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ADM ET_PS A_2 D 

Figure 4 Plot of PSA versus AlogP for candidate compounds showing the 95% and 
99% confidence limit ellipses corresponding to the blood-brain barrier and intestinal 
absorption models. 

Note: The numbers correspond to the compounds in Table 2. 
Abbreviations: ADMET, absorption, distribution, metabolism, excretion and 
toxicity; AlogP, the logarithm of the partition coefficient between n-octanol and 
water; BBB, blood-brain barrier; PSA, polar surface area; 2D, two-dimensional. 

Moreover, to further determine whether Compound 8 
could inhibit the proliferation of A549 cells, we conducted 
clonogenic assay under hypoxia. A clonogenic assay 
clearly showed that Compound 8 treatment significantly 
decreased the colony size and number of colonies in a 
concentration-dependent manner (Figure 5B, C). The results 
from the clonogenic assay were in accordance with the MTT 



assay. Taken together, these results indicated that A549 
could inhibit the proliferation of A549 cells in a hypoxic 
environment. 

Morphological changes and the proliferation capacity of 
A549 cells treated by Compound 8 were investigated through 
EdU staining (Figure 6). In the A549 cells, the combined 
treatment of compound 8 and hypoxia resulting high bright- 
blue fluorescent (DAPI stained) and low red fluorescent 
(EdU stained), and the difference between each groups 
suggested that Compound 8 inhibited the proliferation of 
A549 cells in a hypoxic environment. 

Inducing apoptosis by 
Compound 8 under hypoxia 

To investigate the underlying mechanism of anti-proliferative 
activity, DCFH-DA (2 / ,7 / -Dichlorofluorescin diacetate) was 
applied to detect the change of intracellular reactive oxygen 
species (ROS) level after Compound 8 treatment. As com- 
pared with control cells, Compound 8 combined with hypoxia 
caused a marked increase of ROS levels in A549 cells, enhanc- 
ing ROS to almost triple these levels (Figure 7A). 

To determine whether hypoxia played an important role 
in the induction of apoptosis following Compound 8 treat- 
ment in A549 cells, an Annexin V assay was conducted. 
After a 48 hours treatment of hypoxia with or without the 



Table 3 Compliance of compounds to ADMET prediction and drug likeness properties 


Compound 


Aqueous 


Blood-brain 


CYP2D6 binding 


Hepatotoxicity 


Intestinal 


Plasma 


PSA 


AlogP98 


name 


solubility 


barrier 
penetration 






absorption 


protein 
binding 






I 


2 


4 


False 


True 


I 


True (highly 


39.04 1 


5.035 




(low) 


(undefined) 


(non-inhibitor) 


(toxic) 


(moderate) 


bounded) 






2 


2 


2 


False 


False 


0 


True (highly 


47.97 1 


5.0 1 8 




(low) 


(medium) 


(non-inhibitor) 


(nontoxic) 


(good) 


bounded) 






3 


2 


3 


False 


True 


0 


True (highly 


54. 1 82 


4.2 1 




(low) 


(low) 


(non-inhibitor) 


(toxic) 


(good) 


bounded) 






4 


2 


4 


False 


False 


I 


True (highly 


90.455 


4.6 1 I 




(low) 


(undefined) 


(non-inhibitor) 


(nontoxic) 


(moderate) 


bounded) 






5 


2 


4 


False 


False 


I 


True (highly 


60.222 


3.067 




(low) 


(undefined) 


(non-inhibitor) 


(nontoxic) 


(moderate) 


bounded) 






6 


2 


3 


False 


False 


I 


True (highly 


48.523 


6.232 




(low) 


(low) 


(non-inhibitor) 


(nontoxic) 


(moderate) 


bounded) 






7 


2 


3 


False 


False 


0 


True (highly 


60.476 


5.592 




(low) 


(low) 


(non-inhibitor) 


(nontoxic) 


(good) 


bounded) 






8 


2 


4 


False 
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2 


True (highly 


65.83 1 


3.896 




(low) 


(undefined) 
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(poor) 
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9 


2 


3 
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0 


True (highly 


5I.85I 
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Abbreviations: AlogP, the logarithm of the partition coefficient between n-octanol and water; PSA, polar surface area; ADMET, absorption, distribution, metabolism, 
excretion and toxicity. 
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Table 4 In silico screening of hit compounds for toxicity risk assessment 



Compound 


1 


2 


3 


5 


o 
o 




Rat oral LD 50 


0.689 


1.6 


0.1 14 


0.59 


0.314 


1.12 


(g/kg body weight) 














Rat inhalational LC 50 (mg/m 3 /h) 


8.876 


25.4 


10.0 


8.05 


9.28 


10.0 


Rat maximum tolerated dose 


0.512 


10.0 


2.4 


3.2 


0.749 


1.6 


(g/kg body weight) 














Developmental toxicity potential 


Toxic 


Nontoxic 


Toxic 


Toxic 


Toxic 


Nontoxic 


US FDA rodent carcinogenicity 














Mouse female 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Mouse male 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Rat female 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Rat male 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Noncarcinogen 


Ames mutagenicity 


N on mutagen 


Nonmutagen 


Nonmutagen 


Nonmutagen 


Nonmutagen 


Nonmutagen 


Daphnia EC 50 (mg/L) 


0.85 


8.6 


3.8 


1.9 


0.53 


4.23 


Skin sensitization 


Strong 


Strong 


Strong 


Strong 


Strong 


Strong 


Rat chronic LOAEL 


4.23 


61.0 


1 .5 


1 .4 


2.3 


1 7.7 


(g/kg body weight) 














Fathead minnow LC 50 (g/L) 


I.32XI0 4 


I.08xl0" 6 


2.56x1 0 5 


I.63xl0 6 


8.55x1 0 4 


3.47x1 0 5 


Aerobic biodegradability 


Nondegradable 


Nondegradable 


Nondegradable 


Nondegradable 


Nondegradable 


Nondegradable 


Ocular irritancy 


Mild 


Mild 


Mild 


Mild 


Mild 


Mild 


Skin irritancy 


None 


None 


None 


None 


None 


Mild 



Abbreviations: EC S0 , effective concentration 50%; FDA, Food and Drug Administration; LC S0 , lethal concentration 50%; LD 50 , lethal dose 50%; LOAEL, lowest observed 
adverse effect level. 
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Figure 5 The effects of Compound 8 on A549 lung cancer cell viability. 

Notes: (A) MTT assays showed Compound 8 inhibited A549 cell proliferation in a hypoxic environment in a concentration- and time-dependent manner. Data are 
expressed as means ± standard deviation from three independent experiments. (B) The colony clusters were detected by crystal violet staining, after a 1 0-day Compound 
8 treatment in a hypoxic environment. (C) Statistical results of colony-forming assays presented as surviving colonies. Data are expressed as means ± standard deviation from 
three independent experiments (*P<0.05; **P<0.0l). 
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Figure 6 The fluorescent thymidine analog EdU was used to identify S-phase cells by labeling their replicating DNA (red), nuclei are colabeled with DAPI (blue). 
Notes: (A-C) Blank vehicle under normoxia (21% 0 2 ); (D-F) blank vehicle under hypoxia (1% 0 2 ); (G-l) compound 8 treatment under hypoxia (1% 0 2 ). 
Abbreviations: EdU, 5-ethenyl-20-deoxyuridine; DAPI, 4',6-diamidino-2-phenylindole. 



A C 




Annexin V 



Figure 7 (A) Compound 8 induced apoptosis associated with ROS production. The levels of ROS were measured by DCF fluorescence. Data are expressed as means ± 
standard deviation from three independent experiments (**P<0.0l). (B) Flow cytometric analysis of cells stained with Annexin V-FITC/PI after treatment with 10 (iM 
Compound 8 combined with hypoxia, or hypoxia only. (C) Statistical results of apoptosis assays. Data are expressed as means ± standard deviation from three independent 
experiments (**P<0.0l). 

Abbreviations: PI, propidium iodide; FITC, fluorescein isothiocyanate; ROS, reactive oxygen species; DCF, 2 / ,7 / -Dichlorofluorescein. 
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Figure 8 Compound 8 suppressed tumor growth in the subcutaneous A549 tumor model. 

Notes: (A) Tumor growth curve in each group. (B) Body weight of experimental animals in each group. 



combination of Compound 8 (20 |lM), the level of apoptosis 
was determined by FCM. The apoptosis rates in A549 cells of 
control and Compound 8 combined hypoxia treatment were 
1 .3% and 57.4%, respectively (Figure 7B, C). The data indi- 
cated that treatment with Compound 8 significantly increased 
the rate of apoptosis of A549 cells in a hypoxic environment 
compared with the controls, which were consistent with 
the results of Hoechst staining. Moreover, single treatment 
with hypoxia could significantly reduce the rate of apoptosis 
(Figure 7B, C). Taken together, these data indicated that the 
hypoxic environment is critical for the observed Compound 
8-induced apoptosis. 

In vivo antitumor effect in subcutaneous 
tumor model 

To test the antitumor effect of Compound 8 in vivo, female 
BALB/c mice were inoculated subcutaneously with A549 
cells. Mice then received an intraperitoneal injection (ip) of 
Compound 8 (25 mg/kg and 75 mg/kg, respectively) or vehicle 
every 3 days for 21 days. The results showed that the tumor 
growth of the Compound 8 treatment group become signifi- 
cant slowed after 6 days treatment. Compound 8 substantially 
inhibited tumor growth in a dose-dependent manner compared 
with the control after 2 1 days treatment (Figure 8A). Moreover, 
Compound 8 treatment was well tolerated and did not cause 
significant loss in body weight compared with the vehicle group 
(Figure 8B). The above results demonstrated that Compound 
8 could efficiently suppress tumor growth in vivo. 

Conclusion 

In this study, we attempted to obtain PKM2 activators with 
novel scaffolds via the combined pharmacophore-based 



virtual screening with molecular docking and in silico 
ADMET analysis. The SBP model was established using 
the seven co-crystal complexes of PKM2 and activators. 
The model was further used as a 3D search query to screen 
the focused tetrahydroquinoline chemical library. About 286 
compounds were selected based on the fit value of more than 
4,486 for molecular docking study, which was performed 
to improve the reliability and accuracy of virtual screening. 
The 10 preferable hits based on the best docking score were 
selected for in silico ADMET analysis. The results of in silico 
ADMET study showed that the six hit compounds possessed 
good ADMET properties. These compounds were shifted to 
in vitro pyruvate kinase assay. The results showed that four 
out of the six hit compounds could activate the recombined 
PKM2 with AC 50 values lower than 1.0 jiM. Hence, we 
successfully identified several potent PKM2 activators with 
novel scaffolds via a combination of various in silico tech- 
niques. These compounds had simultaneous binding ability 
with the binding site as well as in vitro pyruvate kinase activa- 
tion. Compound 8 inhibited the proliferation of A549 human 
lung cancer cells and induced cellular apoptosis via the ROS- 
mediated-mitochondrial pathway. Moreover, Compound 8 
could significantly inhibit A549 xenograft tumor growth in 
vivo. Therefore, our studies provided strong evidence that 
PKM2 activators may be a novel lead compound as lung 
cancer drug candidate. 
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Figure S I Superimposition of seven PKM2-activator complexes. 
Abbreviation: PKM, pyruvate kinase isozyme. 




Figure S2 Binding modes and important binding site residues of Compound 8 located in the binding site of PKM2 (ligands in magenta, PKM2 domain in green and cyan, 
respectively). 

Note: The A and B represent two different directional views of Compound 8 located in the binding site of PKM2. 
Abbreviation: PKM, pyruvate kinase isozyme. 
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